Estimating Distribution Functions from Survey Data Using Nonparametric Regression
نویسنده
چکیده
Survey sampling often supplies information about a study variable only for sampled elements. However, auxiliary information is often available for the entire population. The relationship of the auxiliary information with the study variable across the sample allows inferences about the nonsampled portion of the population. Thus, auxiliary information can be used to improve upon survey estimation. In particular, finite population distribution function estimation can be improved. Existing parametric estimators incorporate auxiliary information by assuming it to have a linear relationship with the study variable, which is often unreasonable or unverifiable in survey sampling. A model-assisted nonparametric estimator based on local polynomial regression is introduced which removes these parametric restrictions by working under a more generic context. A Monte Carlo comparison of this estimator with parametric estimators demonstrates its superior efficiency for estimation of the distribution function and quantiles in most cases in which the parametric methods misspecify the relationship between the auxiliary and study variables. Finally, a semiparametric estimator is introduced which allows for the incorporation of parametric fixed effects, including categorical variables. When applied to a population of Northeastern lakes, this estimator is more efficient on average than a more conventional estimator that does not incorporate any auxiliary information.
منابع مشابه
Jump Regression Analysis
Nonparametric regression analysis provides statistical tools for estimating regression curves or surfaces from noisy data. Conventional nonparametric regression procedures, however, are only appropriate for estimating continuous regression functions. When the underlying regression function has jumps, functions estimated by the conventional procedures are not statistically consistent at the jump...
متن کاملTitle of Dissertation : Nonparametric Quasi - likelihood in Longitudinal Data
Title of Dissertation: Nonparametric Quasi-likelihood in Longitudinal Data Analysis Xiaoping Jiang, Doctor of Philosophy, 2004 Dissertation directed by: Professor Paul J. Smith Statistics Program Department of Mathematics This dissertation proposes a nonparametric quasi-likelihood approach to estimate regression coefficients in the class of generalized linear regression models for longitudinal ...
متن کاملLocal linear regression for generalized linear models with missing data
Fan, Heckman and Wand (1995) proposed locally weighted kernel polynomial regression methods for generalized linear models and quasilikelihood functions. When the covariate variables are missing at random, we propose a weighted estimator based on the inverse selection probability weights. Distribution theory is derived when the selection probabilities are estimated nonparametrically. We show tha...
متن کاملEstimating Nonparametric Random Utility Models with an Application to the Value of Time in Heterogeneous Populations
The estimation of random parameters by means of mixed logit models is becoming current practice amongst discrete choice analysts, one of the most straightforward applications being the derivation of willingness to pay distribution over a heterogeneous population. In numerous practical cases, parametric distributions are a priori specified and the parameters for these distributions are estimated...
متن کاملEstimating the error distribution in nonparametric multiple regression with applications to model testing
In this paper we consider the estimation of the error distribution in a heteroscedastic nonparametric regression model with multivariate covariates. As estimator we consider the empirical distribution function of residuals, which are obtained from multivariate local polynomial fits of the regression and variance functions, respectively. Weak convergence of the empirical residual process to a Ga...
متن کامل